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1 . INTRODUCTION 



The purpose of this study is to investigate a use of time 
series in improving weather forecasting. 

Let us consider a simple model. Let us assume that the 
relevant variables, or functions, such as, say, the 500 mb height 
field, can be expressed in terms of a set of orthogonal basis 
functions, whose coefficients define the field. There is also an 
associated numerical integration program which yields predictions 
of this variable in the form of predicted coefficients. This method 
of forecasting is assumed to be more reliable than any regression 
scheme. At regular intervals some observations of the variable 
are made and the predicted coefficients are then updated, or 
corrected . 

These corrections to the coefficients in the forecast are 
analyzed for trends. The corrections for each (complex) coeffi- 
cient of a (complex) eigenfunction or basis function are analyzed 
separately. The purpose is to predict the corrections. 

The basic approach is to define a best-fitting difference 
equation and a best set of initial conditions, by minimizing 
weighted sums of squares of the residuals, and then use the 
difference equatio.n and the initial values to predict the correc- 
tion. These are used in a secondary routine to improve the forecast. 

Testing of the method in the 500 mb field with a simple 
non-divergent model due to Bourke (1972) shows skill with some 



coef fi ci ents . 



2. OUTLINE OF THE PROBLEM 

Let us assume that we have some function, or functions, which 
we are representing in terms of an orthonormal set of basis 
functions. In particular, let us consider the 500 mb height 
function, or perhaps, the stream function, expressed on the surface 
of the earth in terms of spherical harmonics. These are expressed 
as complex trigonometric functions of longitude and Legendre 
functions of the sine of the latitude. At time tj^ the coefficients 
of the spherical harmonics form a vector or column matrix Aj^; its 
components are complex numbers aj^ , k = l , ..., K, if the series 
is truncated to K terms. The vectors A^^, and each of the components 
N’ 3 time series, or sequence. 

The sequence is generated conceptually as follows. Let us 
assume that at time tj^_^ we have the matrix Aj^_i of coefficients 
for the analyzed field. This is considered to represent the true 
field at that time. We have also an integration procedure by which 
we can generate predictions Aj^_i , Aj^_.| fj + i > •••> for the future 
values A|^, Aj^^^ , ..., as many as we feel are useful or reliable. 

Then at time tj^, 12 hours later, observations are made. These are 
combined with the predictions toyield a new analyzed field, 

and a new set of coefficients Aj^ . The corrections (changes, 
errors, or discrepancies), 

Z„ = A„ - A„ , , n = 2, 3, . . . , N, (2.1 ) 

n n n- I , n 

also define a time series or a sequence. If we could in some way 

analyze this sequence, Z 2 , Z^, .., Zj^ , and from it predict 

^N + 1 ’ ^N + 2’ ■■■’ ^N+m then we could predict ahead m steps 
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reliably. Our purpose is to predict more accurately, or more 
reliably, and perhaps to extend the range of prediction. We shall 
see that there are two ways, at least, that this improvement might 
be effected. Both depend on the use of time series as developed 
in the next sections . 

The Z's and/or their components form the data for the analysis. 

As such they are called observations (not to be confused with 
meteorological observations). The terms component and coefficient 
are used interchangeably. Vectors have components and basis 
functions have coefficients when they are used in the representation 
of another function. The components aj^ ^ of A^, for example, are 
the coefficients in the representation of some meteorological 
variable. Basis functions are often called modes, particularly in 
mechanics . 

3. INTRODUCTORY TIME SERIES, SIMPLIFYING ASSUMPTIONS 

Let us consider the sequence of corrections, Z^, n = 2, 3, ..., N. 

The vector Z^ has K complex components. There is one complex number 

z. for each of the K compl e x- val ued spherical harmonics that is 
K 5 n 

used. To save writing, let z^, rather than denote a typical 

coefficient; we will analyze each of these sequences individually. 

Part of the rationale is that in a linear uncoupled system the 
various modes move independently; in a loosely coupled system we 
would expect similar behavior. There are other important reasons 
which we will take up later. We shall usually consider z^ to be a 
pair of real numbers, a two-by-one matrix. 
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Let us now assume that can be expressed as a sum of two 
terms, described below. 



X + y . 
n n 



(3.1 ) 



Here is a random variable. Its components each are assumed to 

2 

have expected value zero and expected square a , independent of n; 
they are also assumed to be uncorrelated. The vector x^ is assumed 
to be a solution to a difference equation of the form 

^n " ^l^n-1 ••• ^p^n-p’ n = p + 2, ..., N. (3.2) 

The coefficients C-j Cp are unknown two-by-two matrices, and 

the order p is to be chosen some way. For the present let us take 
p = 2, which seems at this time to be a likely value. The C's are 
not constants necessarily, but may vary adaptively with N. 

To determine the coefficients in (3.2), let us choose the C's 
to minimize a residual of the form 



1 N p 2 

R=yZw(z -5:C.z .) , (3.3) 

2 ^ n n 1 J •I'J 

where the w^ are positive weight factors to be chosen or determined; 
we will discuss this later. To minimize R let us differentiate with 
respect to the elements of the C's; we get a system of equations of 
the form 



N 

E w 

n 



An-1 ^n-1 ’ 
Vn-p^n-1 ’ 



z , z ' 
n- 1 n- 



z 1 
n- p n- 





(3.4) 
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the prime (') Indicates the transpose of a matrix. The terms like 
^n-l^n-p two-by-two matrices so that the elements of the 

matrices are themselves matrices of order two. 

Another problem is the choice of the weight factors w^. For 
a ti me- i ndependent process we should use equal weightings, which 
simplifies several relations. However the fundamental relations 
may vary rather rapidly with time sometimes, and these are the very 
times that are most critical. One way to take account of this is 
to use exponential weightings in (3.3) and (3.4). After some 
starting routine we weight each new observation with a uniform 
value w and decrease all earlier weightings then by a factor 1 -w . 

We do this as follows. Let be the matrix of coefficients in 
(3.4); it has two-by-two block elements of the form 



^i , j .N 



N 

z 

p + 2 



w z . z ' . , 

n n-1 n- j 



i ,j=l , 



p; 



(3.5) 



the right-hand side of (3.4) has elements Q^. q . When we get the 
observation Zj^, we update Q^. ^ by the relation 

^i,j,N “ ^i,j,N-l ^ '^^^N-i^N-j ~ ^i,j,N-l^‘ (3.6) 

We may actually update Q by first shifting each block element 
down and to the right one place, dropping out the last row and the 
last column. Then we update the first (block) row, using (3.6), 
and then the first column by symmetry. The weight of an observa- 
tion from time t for the coefficients calculated at time t.. is 

n H 

Nl n 

w(l-w) ~ ; we see that weightings decrease exponentially with time 
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The best choice of w is another problem. If w is too small 
the system is slow to respond to changes, and if w is too large, 
random errors cause excessive errors in the predicted values. 

The above procedure for updating the coefficients in (3.2) is 
rather efficient, both in terms of storage and computation. The 
matrix Q is always singular until at least 2p obser vat ions have 
been made. It is usually positive definite after that, but it was 
sometimes found to be near singular, especially for the smaller 
values of N . 

In order to predict the z's ahead, using (3.2), we also need p 
starting or initial values; we can take these conceptional ly as 
estimates of Xj^, x^ .j , ..., Xj^ p+1 ‘ determination of these is 

taken up in the next section; it requires more storage and more 
computation than the above. 

In the following section we will generally consider the case 
P = 2. 

There may be some minor discrepancies in the first index; for 
example, when the a's are involved we tend to think of as the 
first of the z's, and later, when treating the z's we think of the 
first as . 
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4. ROUTINE TO DETERMINE THE BEST FILTERED OR INITIAL VALUES 
FOR PREDICTION 

Let us assume that we have found the coefficients C. in the 

J 

difference Eq . (3.2). We also need estimates of and -j, say 
Xn» ^ order to predict ahead using the difference equation. 

We might just use the last two observations, Zj^, -j , but these 
involve individual random errors, and we expect smoothed or 
filtered values to be more reliable. By using a large value for 
the weighting parameter w we can drive the smoothed values close 
to the last two observations, if we wish. 

Let us resolve the problem as follows. Let us consider 
yn = - Xn (n = 1, ..., N) 

" ^n " ^^l^n-1 ^2^n-2^ ^ 

= z^ - (C-|[C^x ^_2 + ^2X^,3] + C2[C-|X|^_3 + C2X^_^]) 



which we may rewrite in the form 

J'n ' ^ - ^''2 - ^’'l ■ 

A and B are defined by this relation, 
n n 

they satisfy the relations 



( n = 1 , . . . , N ) ; (4.1) 

It is easily verified that 



C,A 1 + C„A^ , 
1 n-1 2 n-2 

0, A 2 = I, 



C,B , + C„B^ „ 
1 2ri"'2 



I, B2 = 0. 



n = 1,2, ..., N, 



(4.2) 
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We can simplify the programming somewhat if we define the 
matrices 



D" = (A„,B„) 

U ~ ^ ^ i ~ ^^12* ^22* ^11* ^21^ * 



(4.3) 



X.. is the i'th component of x.. Then d’’ satisfies Eq . (3.2) and 
J J 

has the initial values 



= (0, I) , = (I , 0) . 



(4.4) 



Equation (4.1) now reduces to 



z - D U. 
n 



(4.5) 



If we knew the four components of U, we could get starting values 
’ ^N-1 • 



^n ■ ^l^n-1 *'2^n-2’ 



(4.6) 



and if we select or estimate U in some way we get corresponding 
estimates for x^j , course it would be nicer to determine 

these in a more direct way, but there seems to be no way to do 

this . 

Now, by assumption, the expected value, E y^. = 0. Hence it 
seems reasonable that the best choice for U is one which minimizes 

a weighted sum of squares for the estimates of y. , say. 



, N ? 1 N „ 2 

R=^Ewy=^Ew (d"u - z ) 

2 n-'n 2 n n 



(4.7) 



To effect this we differentiate with respect to the components of 
U; if we call its components u., the j'th partial derivative yields 

^ ■ ^l,n^ *^2,k^k ' ^2,n^^ 



0 , 
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(4.8) 



0 n 

where are the components D . We solve these equations for U, 

/V 

which defines and from these we get our best estimates, 

starting. 

There are several problems. First, the sequence z^, 
n = 1, 2 ..., N, may be very long, so that computing times and 
errors may be significant. Further the observations for small 
values of n may not be relevant to today's weather. We also need 
to recall many observations. These problems suggest limiting the 
range of summation in (4.7) to, say, the last ten terms, which we 
did. 

Another problem needs some explanation. We really just need 
1 ’ predict ahead. However we cannot get these directly. 

It really does not matter whether we find X|^_-| and Xj^, or 
and we get the former from the latter by use of difference 

Eq . (3.2). We might perhaps try to rewrite the difference 
equation, to solve for x ^ in terms of x , and x„, to avoid 
this. However we must invert the matrix to effect this, and 
there is no reason that it cannot be singular, or nearly so, 
r andoml y . 

In the program we have 15 x's actually indexed from 1 to 15. 
The first 10 of these are the 10 approximants to the last 10 
observations Zj^ g, ..., Zj^; the last five are predictions for the 
future values . •••. ^^+ 5 - 
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X > 



5. TWO SUGGESTED METHODS FOR IMPROVING PREDICTION BY THE USE OF 
A TIME SERIES 

There are two particular methods which seem feasible for 
improving the forecast. Each of these seems feasible, but each 
poses some problems. 

5 .1 Method 1 

Let us consider again the matrices for the coefficients in the 
analyzed fields, A^, n=l , ..., N; we wish to improve the predictions 
of these. Let us assume that we have generated and stored the 
following. First, we have stored the last ten corrections 

, ..., Z|^. We have also stored, for each term in the Z's, 
the matrix Q which defines the coefficients C. in the difference 
equation. 

The routine is the following for each component of Z. As 
soon as the new value of Z, Zj^, is obtained, we store it and 
discard the old Then we update the matrix Q and solve for 

the coefficients of the difference equation C. as described in 
Section 3. Then we solve for the initial values X|^_i ♦ as 

described in Sect. 4, and finally we predict ahead to get 

^N+1 ’ ^N+2’ ***’ ^N+M’ many as we wish. These define the 

/\ 

' s , or X ' s . 

Next let us use the dynamic equation and integrate ahead one 
time step, to get an estimate Aj^ n+ 1 ’ ^N + 1 * Then let, say. 



A 



N,N+1 



A 



N,N+1 



and 



^N,N+1 ' ^N,N+1 ^N+1 * 



(5.1) 
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We get successive values in a similar way. When we have A 



N,N + j 



we integrate to obtain A 



N,N+j+l 



, a first estimate of A 



N,N+j+l • 



Then we add , to get 



A 



N,N+j+l 



A 



N,N+j+l 



+ X 



N+j+1 • 



(5.2) 



With the initial value Aj^ = Aj^, the sequence of predictions. 



or forecasts, is defined. 

In the initial part of the study, a simple integration 
routine was used for predicting, Aj^ n+M’ ^ "lodel using the non- 
divergent barotropic vorticity equation. When this model is used, 
the method of correcting above, alternately adjusting and 
integrating in feasible and practical. 

When a more accurate and complicated weather prediction 
model is used, serious difficulties arise with this method. 
Restarting a time integration after rather arbitrary adjustments 
to one or more variables may generate spurious gravity waves which 
degrade the prediction and offset the adjustments. To eliminate 
this difficulty a second method may be used, in which a separate 
series is generated for each interval of prediction. The appro- 
priate term is used for each interval and the adjusted functions 
are never integrated. 

5.2 Method 2 

There is another method which gets around the above problem, 
at the expense of increasing the storage and computation by a 
factor of roughly M, if we use M different intervals of prediction. 
That is, if we wish, say to adjust the 12-, 24-, and 36-hour 
forecasts we must, generate three time series. 



Now assume that we wish to predict ahead M steps. At the 



time t|^ we will have the predictions based on integration, 

^N,N+1’ ^N,N+M " ^^N,N+m^m = l ,M 

For the corrections we will save M sequences 

\ • ''n-m.n' ^ ' ' • ' ' " = ”'*1 • ■■■. N. 

(5.4) 



Let us consider any particular value of m. To generate the data 

for the time series we must store each time the uncorrected 

predictions A Then at time t we will obtain and store 

n , n+m n+m 

from (5,4). 

We will thus have M time series to be analyzed. Each is 

analyzed as discussed earlier. In this case however we will use 

a single prediction from each one: from we generate the single 

correction for m time steps ahead. If we denote the corrections 
^ m 

by the modified values will be 
n 



A = A + X 

n , n+m n , n+m n 



(5.5) 



This method has the advantage that the modified terms are not 
integrated. It has the disadvantage that it requires one time 
series for each value of m used, which leads to larger storage 
and computational requirements. 
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COMMENTS 



6 . 

The problem here differs from the most common applications of 
time series. We have a rather small amount of information which 
tends to be masked by a large random element. From this we are 
trying to predict transients, or trends, for relatively short 
periods, perhaps one to ten time steps. We are not directly 
interested in the smoothed values, since we are not concerned with 
what has happened, except as our ability to fit it reflects on our 
ability to predict. Our feeling is that the variables we are 
predicting may change rather rapidly so that the weather two weeks 
ago is of little interest; even the equation which governs the 
behavior may well have changed. We cannot really make use of 
long-term observations, as for a stable system, which allow more 
accurate predictions. 

We are trying to predict the short-range behavior of a part 
of a non-linear system of high order by solutions to a linear 
system of low order. Whether we try to do this by a difference 
equation or by Taylor series depends on the type of data, and the 
nature of the expected solutions. In our case both the form of 
the data and the anticipated periodic properties of the expected 
solutions suggest difference equations. 

The papers by Jones (1963), (1964) particularly, and (1965) 
suggested the general problem here. These treat more the 
descriptive and the long-range forecasting problem. The series 
are considered to be stationary in the sense that all observations 
are weighted equally; several formulas are then simplified. 
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The references found most useful in this study were Whittle 
(1963) and Gelb (1974). Recent developments in the use of spherical 
harmonics in meteorology (see 6ARP, 1974) and Fast Fourier Transforms 
make the methods appear feasible. 

The method is clearly that of least squares. It is also a 
linear regression method; all of the equations to be solved are 
linear. If we were to use, say, our estimates , Xj^ in a 

routine to try to improve the C's, then it would be non linear. 

The solutions to the difference equations are basically complex 
exponentials, that is, a combination of real exponentials and 
trigonometric functions. 

There are a number of points that should be discussed or 
clarified. 

6 . 1 Co ns i s tency 

It should be pointed out that even for a stationary system 
the solutions are not consistent, as follows. Let us assume that 
we have a solution to a difference Eq . (3.1) to which random 
uncorrelated terms y , each with expected value 0 and expected 
square a is added. Then Eq . (3.4) will not yield the desired 
coefficients in (3.2). The expected elements of the matrix of 
coefficients Q in (3.5), from (3.4) will be 



^i ,j ,N 



= E 






X . + 6 . . 0 

n-j 1 j 



> 2 > 



i ,j=l , 



( 6.1 ) 



here I 2 is the second-order identity matrix, and 6 is the Kronecker 

6, (=1 when i=j and 0 when i ^ j ) . That is, the Q matrices have an 

extra term a on the main diagonal. This suggests that if we have 
2 

a measure of a we could subtract it from the elements on the main 
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diagonal of Q, in a sort of negative ridge-regression scheme. 

This is very risky procedure however, since it drives the matrix 
of coefficients toward singularity. Difficulty has been 
encountered several times because this matrix Q was singular, and 
the danger is aggravated when exponential weighting with a large 
decay rate is used. 

6 . 2 Best Order for the Difference Equation 

We have suggested that we use a second-order difference 
equation, p = 2. Some of the reasons follow. Let us consider a 
system which is time independent, and assume that have found the 
correct order p and the true coefficients C in the difference 

2 

equations. In this case the residuals yield an estimate for a . 
Now consider the residual 



R 



N 

E 

P+1 



n n 






j 



+ 




since the x's satisfy (3.2). The expected value of the residual 
is then 

E(R) = (e w ) (2 + E C^ . .) a^, 

where the last term in the parenthese denotes the sum of the 
squares of all of the components of all of the C's. 

Now if we try to fit the solutions to a second-order system 
by a third-order difference equation, in the absence of noise, the 
matrix of coefficients is singular, since the solutions satisfy 
many third-order difference equations. If we have a little noise 



we may have a near-singular matrix and have large values for the 
^i»j»k’ ® large expected value in (6,1) for 

the residual. Limited numerical checks make the second-order 
equation appear satisfactory. 

6 . 3 Determining Initial Values 

There was no particular reason for choosing ten values to fit 
when we determine the starting values for prediction. It was felt 
that six would be the smallest number to be considered, that fewer 
would make the methods vulnerable to random errors in the last 
terms. Many more than ten could cause roundoff errors. 

6.4 Criteria for Adjusting Terms 

We do not expect to adjust or correct all of the coefficients. 
For some the errors may be too small to warrant the time and 
trouble. Other coefficients may contain such random elements and 
change so unpredi ctabl y that trends are effectively obscured. The 
decision to adjust or correct will probably depend on an analysis 
of similar sequences in the past. There are two basic ways to do 
this, one based on a long sequence of past data (that is, on 
climatology), and the other on recent data (corresponding to recent 
weather ) . 

We may analyze a long sequence for a coefficient, perhaps over 
a period of years, and from this make a decision, beforehand 
always to correct, or never, in the program. 

An alternate way is to see how well the smoothed values fit 
the last ten corrections. We generate these values naturally in 
determining the starting values for prediction, so this entails 
little computation. We may consider, for example, the ratio 
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10 , 10 . 

\ «n(^N-10+n - «n^N-10 + n ' 

If this is much less than one we have fitted the past ten values 
well, and if it exceeds one we have no skill in fitting the 
observations. The factor 1.25 adjusts for the fact that we could 
fit two terms exactly with the initial conditions. We might 
correct or not accordingly as this number was below or above some 
criterion, say, .75. The philosophy is that if we can fit a 
sequence of observations well, then we can predict well. (More 
logical is the conversejif we cannot fit the observed values, then 
we cannot predict well.) It is probably not worthwhile to use 
this criterion unless we usually correct. 

6 . 5 Criteria for Improvement 

In the numerical study we used two principal criteria for 
improvement. Both of these compared in some way the magnitude of 
the errors in prediction with adjustment with the errors generated 
without the adjustment. In the first criterion, the error in the 
predicted values(s) was compared with the rms value of the last 
ten corrections (without adjustment). This figure was calculated 
as the predictions were under way. For the second criterion, a 
long sequence of adjustments were calculated. Then the rms values 
of the errors in prediction after adjustment was compared with the 
rms value of the error in prediction without adjustment: 

eIz - X l^/2:|z 1^; 

' n n ' ' n ' ’ 

the range of summation of n is over each term for which a correc- 
tion to z was made. These measures are easily made in the machine, 
and they define a simple norm for the error relation. 
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It is not clear these norms are good ones from a meteorological 
point of view. Very likely the ultimate test will rest on maps and 
criteria for judging them. 

6.6 Real Coefficients 

Most of the coefficients are complex. The one associated with 
the modes that are independent of longitude (meridianal) are real 
and hence one dimensional. The routines developed then involve 
scalar equations, rather than two-dimensional vectors. The 
corresponding spherical harmonics are the Legendre polynomials, 
the simplest of the Legendre functions. 

7. RESULTS 

The procedure for analyzing the time series was checked on 

several sets of made-up data, and on two coefficients ^4 

1 2 

(of P-| and ? 2 ^ ^ using a difference equation of second order, p = 2. 
For data made up of solutions to a difference equation, with 
random noise added, the solution could be recovered if the noise 
was not too large, but the routine did not perform very well when 
the energy in the noise (rms value) was larger than in the solution. 

For meteorological data, on a string of 60 pairs of correc- 
tions, the predictions x.|.| to led to a definite improvement for 

one coefficient. The best value for the weighting w of the new 
values seemed rather small: the values .025 , .05 and .1 gave 

similar results and were better than larger values. The numbers 
below were determined as follows. We considered the coefficient 
of p| . A set of 10 data points were used at the start to get 
coefficients for the best difference equation and the initial 



18 



values. Five values were predicted, corresponding to the 12-hour 

correction to be made in 12 hours, 24 hours, ..., 60 hours. The 

error in predicting these was then calculated and normalized by 

dividing by the rms of the 10 original data. Then the time index 

was advanced by one and the operation repeated. 

The numbers in row one are the rms values of a , - a 

n+1 n , n + 1 

normalized by the rms values of z„ z„ q, ..., z„. The numbers 

n “ y n - o n 

in row m denote the rms value of a„,„ - a„^„ , normalized by 

n+m n+m- 1 , n +m •’ 

the rms value of z •••> z . Each is then a correction to be 

n- y n 

applied after integrating one time step. Each entry comes from 
the sum of 46 terms . 



No . of 
Time Steps 


Wei ght w 


.025 


.05 


.1 


.2 


.4 


1 






.6239 


.6257 


.61 31 


.6404 


.81 38 


2 






.6370 


.6401 


.631 2 


.6629 


.8640 


3 






.6670 


.6751 


.6834 


.7599 


1 .3095 


4 






.6573 


.6698 


.6960 


.8314 


1 .5523 


5 






.651 3 


.6636 


.6880 


.8496 


1 .91 09 


There 


are several interesting 


po i n ts 


about the data. First 


the results 


from 


the smal 1 


val ues 


of w, from .025 


to . 1 


are all 


compa rabl e . 


This 


indicates 


a stable trend 


: when 


w = . 


025 the 



first of the 10 data points had a weighting almost .8 of the last; 
they are weighted nearly equally. For w = .10 the relative 
weighting is about .39. A surprising feature is that all five of 
the predictions are so similar. This is further indication of a 
stable trend, a large "random" element, and a satisfactory value 
of p, and suggests longer predictions are feasible. 
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The corrections to the fourth coefficient were not predicted 
successfully; the error in the predicted values was consistently 
a little large than the corrections. However various features of 
the solutions are quite similar to the other solutions; the 
smaller values of w give better predictions and all five predicted 
values are similar in size, with later predictions often anomalously 
having smaller error than the first. 
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SYMBOLS: DEFINITIONS, AND PLACES WHERE INTRODUCED 



Section 2 

t^, time N is usually associated with the last 

or next-to-last observation. 



A 



n 



^ k , n 



the matrix, or vector, of coefficients for 
the analyzed field at time t^. 

the k'th element of A^ , k=l , K. Most 

of the a. are complex numbers and are 
K 5 n 

treated like two-dimensional vectors. 



K 



a 



n 



(=a 



k 




the number of spherical harmonics used to 
represent a function. 

a typical element of A^, written without 
index k to save writing. 



^n ,n.| 

^n~^n”^n-l ,n 



the value predicted for A at time t . 

n 1 n 

This represents the discrepancy between the 

analyzed field at t^ and the value predicted 

for it at time t ,. Also called a correction. 

n- I 



Section 3 



z 



n 



y 



n 



a typical element of Z^. 

the component of z^ that is assumed to be 
random . 



R 

w 

w 

n 



the component of z^ that is assumed to satisfy 
a difference equation (see below). 

two by two matrices in the difference equation 

X =C,x ,+***+Cx 
n 1 n-1 p n-p 

weighted sum of residual errors. 



weighting of new terms. 

the weighting of term associated with t^, at 

time N^n;w decreases with N for a fixed 

^ " n \N-n 

val ue 0 f n: w = w ( 1 -w ) 
n 
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Section 3 (continued) 



X a best approximation for x^, made at some 

definite time tj^. (It might also have been 
designated by Xj^ ^ ^.) 

the vector, or matrix, of 

Section 4 

A , B matrices defined in Eq. (4.2), used in 

n n 

finding x ' s . 

^ n 

d" = (A^,BJ (see Eqs . (4.3), (4.4)). 

X.. the i'th component of x., i,j=l,2. 

' J J 

U “ (x.|2,X22>Xi.j *x*|2^ 



an optimal predictor for made at tj^ . 

a temporary estimate of i\|+m’ ^ 

obtaining 

A 

n-m ,n 

typical elements of z'^ . 

R a general symbol for a quadratic residual 

to be mi ni mi zed . 



Section 5 

,N+m 

,N+m 
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